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Abstract 

We describe a combinatorial approach for investigating properties of 
rational numbers. The overall approach rests on structural bijections be- 
tween rational numbers and familiar combinatorial objects, namely rooted 
trees. We emphasize that such mappings achieve much more than enu- 
meration of rooted trees. 

We discuss two related structural bijections. The first corresponds to a 
bijective map between integers and rooted trees. The first bijection also 
suggests a new algorithm for sifting primes. The second bijection extends 
the first one in order to map rational numbers to a family of rooted trees. 
The second bijection suggests a new combinatorial construction for gen- 
erating reduced rational numbers, thereby producing refinements of the 
output of the Wilf-Calkin[l] Algorithm. 

1 The Combinatorics 

The word " Combinatorics " here mostly refers to the combinatorics of trees, 
more specifically labeled rooted trees see Fig[l]. The defining property of a tree 
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Figure 1: Typical example of a rooted tree. Root node: r, leaves: {vi, 1)3,1)5}, 
(parent, child) pair: (1)2,1)3), connected path: r — > V2 — > «4 — > 1)5. 



Figure 2: A valid label assignment with labels = 2,3,5 and 7. 




Figure 3: An invalid label assignement with labels = 2,3,5 and 7. 

is that precisely one path connects any two vertices of the tree. At various parts 
of the discussion we will assign labels to the vertices of the trees. Rooted trees 
have each a special vertex (typically labeled r) referred to as the root of the tree. 
Moreover a collection of rooted trees is called a planted forest. For convenience 
in the subsequent discussion the word "tree" will refer to a rooted tree while 
the word "forest" will refer to a planted forest. 

Each vertex v ^ root , is adjacent to a single vertex whose distance to the root 
is smaller than the distance between v and the root. (The distance between 
two vertices here refers to the number of edges on the unique path connecting 
the two vertices.) Such a vertex is called the parent of v and all other vertices 
adjacent to v are called children vertices of v. 

The analogy to human family relations is extended to include notions such as 

siblings and grandparent relationships between vertices. We assume that such 

relationships between vertices are unambiguous within the current context of 

trees. Vertices with no children are called leaves of the tree. 

Finally, a label assignment to the vertices of a given tree is considered valid, if 

the root is labeled r and no two sibling vertices are assigned the same label. See 

Fig[2,3]. 

1.1 Operations on trees and forests 
1.1.1 Grafting trees and forests 

The first operation on trees that we introduce here is the grafting operation. 
Let T a and Tp denote two rooted trees. We say that the tree T 7 results from 
the grafting of T a onto Tp denoted 

— T a A Tp (1) 
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Figure 4: Grafting operation on trees. 



iff T a and Tp are disjoint rooted subtrees of T 7 and most importantly T 1 has 
no other rooted subtree disjoint from the subtrees T a and Tp as illustrated in 
Fig[4]. 

Note that, it follows from the definition that the grafting operation is commu- 
tative 

T a A Tp = Tp A T a . (2) 

The single- vertex rooted tree (having only the root vertex), can be thought to 
represent the neutral (or identity) element for the grafting operation. We write 

root A root = root (3) 

and 

T a A root = root A T a = T a (4) 

The tree grafting operation induces a natural algebra on forests described below. 
Let T = {Tk} 1<k<n and Q — {T[} 1<l<m denote two forests. The grafting of 
the forest T onto the forest Q amounts to creating a new forest made of trees 
resulting from the grafting of all possible pairs of trees (T„, T^) such that 

(T M ,T:)6Jx5. (5) 

Hence, 

T t\Q = {Tfc A T/} (6) 



1.1.2 Raising forests 

Let T — {Tk} 1<k<n denote a forest and T a denote a tree. The operation of 
raising the forest T by the tree T a noted 

1l(T a ,T) (7) 

consists in substituting every tree T k in the forest T with an extended tree T' k 
constructed by rooting the tree T k at every leaf of T a as illustrated in Fig[5]. 
We note that the raising operator is not commutative. It also follows from the 
definition that for any tree T a we have 

1Z (root, T a ) = root. (8) 

Finally, for convenience we adopt the convention that 

K{T a ,root) =T a . (9) 



3 



Figure 5: Raising operation on forests. 



1.2 Generating the forest of all labeled trees of height 
bounded by h using n labels. 

We recall that a label assignment to vertices of a tree is considered valid if the 
root is unlabeled and no two sibling vertices of the tree are assigned the same 
label. We assume here that the label set is of size n, and furthermore, if we 
restrict our attention to the trees of height bounded by a positive integer h, it 
is clear that the set of such trees is finite. 

We now describe a construction which produces the set of trees described above 
as a forest. For this purpose we consider the following recursion. 

G = {root} (10) 

and 

Gi= /\ {root, 1Z (Tk, Gi-i)} , (11) 

0<k<n-l 

that is, 

d = {root, K {To, G,_i)} A • • • A {root, TZ (T n _i, (12) 

where Tk denotes the tree made of the root and an additional vertex who is 
assigned the k th label from our labeling set L = {0,1, ■ ■ ■ ,(n— 1)}. 
For illustration purposes we express G\ 

Gi = f\ {root, T k } . (13) 

0<fe<n-l 

= {root, T } A ■ • • A {root, T„_i} . (14) 

By construction we have that for 1 < i < h the trees in the forest Gi are distinct, 
validly labeled and of depth bounded above by i. Furthermore 

G s £ G s+1 (15) 

It also follows from the definition of the recurrence that Gi has 2™ elements and 
that the number of trees in the forest Gi is prescribed by the recurrence relation 

\Gi\ = (l + |G,_i|) n (16) 
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Theorem 1: The forest Gh contains all trees with valid vertex label assignment 
and of height bounded by h. 

Proof : The theorem is proved by first observing that the validly labeled 
trees of height bounded by h correspond to rooted subtrees of the complete 
n-ary tree of height h. A simple counting argument reveals that the number of 
such trees is determined by the recurrence relation 

So = 1 (17) 

and 

VI < i< h, Si = (l + ^_i) n (18) 

where Si denote the number of distinct rooted subtrees of the complete n-ary 
trees with height bounded by i. We therefore conclude the proof by observing 
that 

VO <i < h, \Gi\ = Si. □ (19) 

2 Applications to Number Theory 

We discuss here the structural bijection between the set of integers and the set 
of valid prime-labeled trees. The mapping of an arbitrary valid tree (labeled 
with primes) to an integer, is established by equating the sibling relationship 
between vertices to integer multiplication and also equating for non-root vertices 
the parenthood relation to integer exponentiation. The process is therefore 
algorithmic and will be referred to as evaluation of a tree to an integer. We may 
point out that the evaluation process is recursive and insist that the evaluation 
be always initiated at the leaves because of the non-associativity of integer 
exponentiation. 

On the other hand the mapping of an arbitrary integer to a valid prime-labeled 
tree immediately follows from recursively applying the fundamental theorem of 
arithmetics to the powers of the prime factors. The single vertex tree (only 
having the root as a vertex) is associated to the integer 1. It therefore follows 
that the mapping between trees and integers is bijective. 

2.1 Combinatorial Prime Sieve Algorithm 

Modern sieve theory focuses on providing accurate estimates for the number of 
primes in some integer interval. In contrast, as an application of the combina- 
torial framework described above we discuss a variation of the classical sieve of 
Eratosthenes. We recall some notation convention, 
[n] denote the set of consecutive integers from 1 to n, 
P denotes the set of prime numbers 
N denotes the set of strictly positive integers. 
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Algorithm Combinatorial Prime Sieve 
Input: For q G P the set P n [q] 
Output: The set P D ([2g]\[g]) 
1- G out <- {0}(*Output list*) 

2. G res t <— {0}(*restricted list of composites*) 

3. G u {0}(*old extended list of integers*) 



4. G new 4 Ao<fc<n— i { roo ^?-^fc} (*new extended list of integers*) 

5. while (3T G (G n ew\G id) s.t. evaluation(T) < 2q ) 

6. do G i d <- G 

new 

7. G 

new 

*~ Ao<fe<n-i ( ro °t>'R-(Tk,Goid)) 

8. 

9. for i€ [l,...,|(G^»\Gow)|] 

10. do 

11. if (q < evaluation((Gnew\G id) [i]) < 2q) and {(G new \G id) [i] G rest ) 

12. (*found a new composite*) 

13. then G reS fOrderedInsert{(G new \G id) [*]} ( *added the 

new composite to restricted the list*) 

14. for i e [l,...,\G rest \ - 1] 

15. do 

16. if (evaluation(G res t[i + 1]) == evaluation(G res t[i}) + 2) 
(*found a new prime*) 

17. then G ou t G ou t.Append{emZiiai«on(G res t[i]+l)} (*add 

it to the list*) 

18. 

19. return G ou t 



Theorem 2.1. G P the algorithm above will produce all the primes in the 
range (q, 2q) 

Proof. The correctness of the algorithm follows from the well established fact 
that for all n > 1, there exist some prime number p with n < p < 2n [2], in 
addition to two other facts. The first of which is that in the range (q, 2q) there 
is no composite whose corresponding tree has a vertex labeled with a prime 
greater than q. The reason being that 2q is the largest of the composite less 
than or equal to 2q, in other words the smallest composite admit ing a prime 
p greater than q as a vertex label in the associated tree would be of the form 
2p and consequently necessarily greater than 2q. The second fact is that all 
the composites whose corresponding trees are labeled with primes less than or 
equal to q are determined by the combinatorial algorithm described in section 
1.2. Furthermore, the sought-after primes in the range (q, 2q) will be uncovered 
by identifying trees in the resulting forest which evaluate to nearest composites 
whose difference equals 2. □ 

2.2 Re-enumerating the rationals 

Georg Cantor was the first to establish the suprising fact that the set Q + is 
countably infinite. In pQ Neil Calkin and Herbert Wilf introduced a sequence 
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Figure 6: tree T k . 



Figure 7: tree T k -i. 



which listed the elements of Q + so as not to include duplicates in the sequence. 
We discuss here another sequence for listing the rationals so as not to include 
duplicates. Our sequence follows from the integer combinatorial encoding de- 
scribed above. 

The proposed construction for listing elements of Q + has the benefit of 
providing explicit control over the subset of the prime numbers used to express 
the rationals in the sequence. 

Let Tfc denote the tree made of a root vertex and an additional vertex labeled 
with the k th prime (see Fig[6]), and let T k -i denote the tree made of a root vertex 
and an additional vertex labeled with the inverse of the k th prime (see Fig[7]). 
Building on the recurrence for Gi discussed in 1.2 we write the recurrence 

G = {root} (20) 

and 

Gi= /\ {root, ^(T fc ,Gi_i)}. (21) 

feeN 

We use the recurrence for d to create a new Hi recurrence defined by 

Hi= /\{K(T k -i,Gi),root,1l(T k ,Gi)}. (22) 

feeN 

We also have 

Hi £ H i+1 . (23) 

The trees in the forest Hi evaluate to distinct rational numbers in reduced form 
and the rooted trees in the forest Hi all have depth less than or equal to i. The 
trees which evaluate to non-integers have special vertices that are prime-inverse 
labeled and attached to the root. Furthermore, it follows from the fact that 
siblings have different labels that corresponding rational numbers are in their 
reduced form. 

Hence as a corollary of Theoreml the trees in forest H^ bijectively maps to 
Q+. 
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